Multiple fundamental frequency estimation based on sparse representations in a structured dictionary
نویسندگان
چکیده
a r t i c l e i n f o a b s t r a c t Automatic transcription of polyphonic music is an important task in audio signal processing, which involves identifying the fundamental frequencies (pitches) of several notes played at a time. Its difficulty stems from the fact that harmonics of different notes tend to overlap, especially in western music. This causes a problem in assigning the harmonics to their true fundamental frequencies, and in deducing spectra of several notes from their sum. We present here a multi-pitch estimation algorithm based on sparse representations in a structured dictionary, suitable for the spectra of music signals. In the vectors of this dictionary, most of the elements are forced to be zero except the elements that represent the fundamental frequencies and their harmonics. Thanks to the structured dictionary, the algorithm does not require a diverse or a large dataset for training and is computationally more efficient than alternative methods. The performance of the proposed structured dictionary transcription system is empirically examined, and its advantage is demonstrated compared to alternative dictionary learning methods.
منابع مشابه
Compressive Parameter Estimation with Earth Mover’s Distance via K-Median Clustering
In recent years, sparsity and compressive sensing have attracted significant attention in parameter estimation tasks, including frequency estimation, delay estimation, and localization. Parametric dictionaries collect observations for a sampling of the parameter space and can yield sparse representations for the signals of interest when the sampling is su ciently dense. While this dense samplin...
متن کاملWideband DOA Estimation via Sparse Bayesian Learning over a Khatri-Rao Dictionary
This paper deals with the wideband directionof-arrival (DOA) estimation by exploiting the multiple measurement vectors (MMV) based sparse Bayesian learning (SBL) framework. First, the array covariance matrices at different frequency bins are focused to the reference frequency by the conventional focusing technique and then transformed into the vector form. Then a matrix called the Khatri-Rao di...
متن کاملCompressive parameter estimation via K-median clustering
In recent years, compressive sensing (CS) has attracted significant attention in parameter estimation tasks, including frequency estimation, time delay estimation, and localization. In order to use CS in parameter estimation, parametric dictionaries (PDs) collect observations for a sampling of the parameter space and yield sparse representations for signals of interest when the sampling is suff...
متن کاملCompressive Parameter Estimation with Emd
COMPRESSIVE PARAMETER ESTIMATION WITH EMD FEBRUARY 2014 DIAN MO B.Sc., BEIHANG UNIVERSITY M.S.E.C.E., UNIVERSITY OF MASSACHUSETTS AMHERST Directed by: Professor Marco F. Duarte In recent years, sparsity and compressive sensing have attracted significant attention in parameter estimation tasks, including frequency estimation, delay estimation, and localization. Parametric dictionaries collect si...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Digital Signal Processing
دوره 23 شماره
صفحات -
تاریخ انتشار 2013